Disambiguation Strategies for Data-Oriented Translation

نویسندگان

  • Mary Hearne
  • Andy Way
چکیده

The Data-Oriented Translation (DOT) model – originally proposed in (Poutsma, 1998, 2003) and based on Data-Oriented Parsing (DOP) (e.g. (Bod, Scha, & Sima’an, 2003)) – is best described as a hybrid model of translation as it combines examples, linguistic information and a statistical translation model. Although theoretically interesting, it inherits the computational complexity associated with DOP. In this paper, we focus on one computational challenge for this model: efficiently selecting the ‘best’ translation to output. We present four different disambiguation strategies in terms of how they are implemented in our DOT system, along with experiments which investigate how they compare in terms of accuracy and efficiency.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring the Translator\'s Solutions to the Translation of Conversational Implicatures from English into Persian: the Case of Tolkien\'s the Lord of the Rings

The present study aimed to examine the translatorchr('39')s solutions to the translation of conversational implicatures from English into Persian. To do so, 120 conversational implicatures were extracted from the novel the Lord of the Rings (Tolkien, 1954) and classified based on Gricechr('39')s (1975) categorization of Maxims, including quality, quantity, relevance, and manner. Mur Duenaschr('...

متن کامل

Data-Oriented Translation

In this article, we present a statistical approach to machine translation that is based on Data-Oriented Parsing: Data-Oriented Translation (DOT). In DOT, we use linked subtree pairs for creating a derivation of a source sentence. Each linked subtree pair has a certain probability, and consists of two trees: one in the source language and one in the target language. When a derivation has been f...

متن کامل

Translating Political Texts with and without the Defined Skopos: The Case of Iranian Translators

The aim of the present study was to investigate the importance of skopos in the translation of political texts from English into Persian. To do this, 30 Iranian translators were conveniently selected and equally divided into the in-house and freelance translators. They were asked to translate a translation test encompassing 10 short political texts that were extracted from English news websites...

متن کامل

Multiple Strategies for Automatic Disambiguation in Technical Translation Multiple Strategies for Automatic Disambiguation in Technical Translation

Author(s) hidden for anonymous review Institute also hidden Address also hidden (probably two lines) Email also hidden Abstract The use of knowledge-based machine translation with controlled technical text can produce high-quality translations. However, building and maintaining knowledge bases can require signiicant time and eeort, since they typically involve hand-coding of semantic preference...

متن کامل

Multiple Strategies for Automatic Disambiguation in Technical Translation

The use of knowledge-based machine translation with controlled technical text can produce high-quality translations. However, building and maintaining knowledge bases can require significant time and effort, since they typically involve handcoding of semantic preferences. When a system can't disambiguate based on semantic preferences, it can initiate interactive disambiguation with the author t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006